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Median statistics and the Hubble constant 

Gang Chen^ and Bharat Ratra^ 

ABSTRACT 

Following Gott et al. (2001), we use Huchra's final compilation of 553 measurements 
of the Hubble constant (Hq) to determine median statistical constraints on Hq. We 
find Hq = 68 lb 5.5 (or ±1) kms~"'^Mpc~^, where the errors are the 95% statistical 
and systematic (or statistical) errors. With about two-third more measurements, these 
results are close to what Gott et al. found a decade ago, with smaller statistical errors 
and similar systematic errors. 

Subject headings: cosmology: observation — methods: statistical — methods: data 
analysis — cosmology: distance scale — large-scale structure of the universe 

1. Introduction 

The long and involved history of increasingly more accurate and precise measurements of the 
Hubble constant has resulted in an extensive list of more than 550 Hq values recorded by Huchrall] 
Rowan-Robinson (2009) notes that most recent (central) estimates of Hq lie in the range of 62 to 
72 kms~^Mpc~^, although individual estimates can differ amongst themselves by 2 or 3 standard 
deviations (for recent reviews see Jackson 2007; Tammann et al. 2008; Freedman & Madore 2010). 
While this is unfortunate, and perhaps not unexpected, Huchra's extensive compilation may be used 
to derive a more precise meta-estimate of Hq that is more robust than any individual estimate. 

We follow Gott et al. (2001, hereafter GOl) and use median statistics to determine what 
Huchra's Hq central values alone (i.e., ignoring the quoted errors) tell us about the true value and 
uncertainty of the Hubble constantly With about two-third more data than GOl (553 measurements 
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versus 331), we confirm and strengthen the results of GOlO We also examine how the estimated 
value of Hq changes as we consider different subsamples of the complete list, and argue that the 
estimate from the complete list is a robust estimate of the Hubble constant. 

Our paper is organized as follows. We first review some basic median statistics concepts from 
GOl in the following section. In Sec. 3 these are applied in an analysis of Huchra's Hq list, where 
we also discuss some consistency tests and give constraints on the Hubble constant. We conclude 
in Sec. 4. 



2. Median statistics and errors 

Compared to a analysis, a median statistics analysis requires fewer hypotheses and is much 
less sensitive to being biased by outliers. See GOl for a comprehensive introduction to median 
statistics and its applications Here we restate the basic idea from GOl and emphasize some key 
points that are relevant to our analysis. 

The basic idea of median statistics is that the true value of a physical quantity is the median 
of the set of (error-affected) measurements. This is based on the assumption that the data set 
meets two statistical requirements: (1) all the measurements are independent; and, (2) there is 
no (overall) systematic error for the whole data set as a group. In other words, as the number of 
independent measurements goes to infinity, the median will converge to the true value. The median 
does not depend on the measurement errors (GOl). 

Consider a data set consisting of N measurements for a quantity that meets the two require- 
ments above. Sort the measurements from the lowest value to the highest and label them Mj 
respectively, where i = 1,...,N. Then the probability that the true value for the quantity lies 
between Mj and Mj+i is 

2-^A^! 
' ~ i\{N-i)V 

where we set Mq = — oo and Mn+i = +oo (GOl). The range from Mj to Mn+i-j (where j < N/2) 
defines a confidence limit (hereafter c.l.) of Cj percent where 

Cj = 100 X {Pj + Pj+i + ... + PN_j). (2) 

The Cj's are a finite number of discrete values, with the number depending on A^. So for any 
confidence limit commonly used, for example, the 95% c.l., we take the c.l. corresponding to the 



■^For an analysis of an intermediate version of Huchra's list with 461 measurements, see the Appendix of Chen et 
al. (2003). Chen et al. (2003) did not estimate systematic errors bars; instead they used the earlier systematic error 
estimate of GOl. In this paper we estimate systematic error bars for the new list of Ho measurements. 

*For further applications and more recent developments see Podariu et al. (2001), Avelino et al. (2002), Chen & 
Ratra (2003), Sereno (2003), Bentivegna et al. (2004), White et al. (2007), Richards et al. (2009), and Shafieloo et 
al. (2010). 
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Cj which is the smahest among those larger than 95 (GOl). These confidence hmits do not depend 
on the measurement errors (GOl). 

Note that the systematic error in the second requirement above is different from the individual 
systematic error quoted as part of the error for an individual measurement. If the systematic errors 
for the individual measurements are not correlated, we can treat them as random errors when 
combining individual measurements of a whole data set, as discussed in GOl and below, and the 
total error can be estimated by studying the histogram of the whole data set, without going in to 
details of the error analysis. But if all measurements are affected by the same systematic shift, 
i.e., there is a systematic error at the whole data set level, a median statistics analysis will give an 
incorrect result. This is unlikely to be an issue for the Hq data (GOl). The intermediate case is that 
a subgroup of data share a similar individual systematic error. Here we use 'subgroup systematic' 
error to denote the part of individual systematic errors that are common to all measurements within 
the subgroup. 

Subgroup systematic error is likely the main reason that the first requirement above (statistical 
independence) is not satisfied. One estimate of the error contribution from this effect may be derived 
by dividing the N measurements into subgroups that belong to different measurement techniques 
(measurements in each such subgroup could very likely be affected by similar systematic effects), 
and then studying the differences between results from each subgroup (GOl). 

3. Application to Huchra's Hq list 

3.1. Huchra's Hq list 

The final version of Huchra's Hubble constant measurements list, updated on October 7, 2010, 
contains 553 published estimates (rounded to the nearest 1 kms~^Mpc~^), some as recent as 
September 2010. All but three of them come with error bars. Most include both statistical and 
systematical errors, although a few have only statistical errors. In this paper we use only the 
quoted central Hq value, and not the error, for each measurement. For simplicity, we also use a 
dimensionless number h instead of Hq, where Hq = 100/ikms~^Mpc~^. 

Aside from some that restate previous results, most values on Huchra's list are measurements 
that are either from different observations (different raw data) , different data processings (including 
calibration and correction), or different methods (different relation between distance and observ- 
able), and may include different biases. Each of these has an associated errojf], any of which may 
make the final Hq value differ significantly. There are examples where the same observations and 
the same estimation technique results in differences as large as two standard deviations (see, for 
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example, Rowan-Robinson 2009). The complexity of error sources, and the difference in systematic 
errors estimated by different workers in the field, make it a worthy goal to use median statistics to 
derive a summary estimate of if„ from Huchra-=1M@ 

All but one measurement in Huchra's list have a primary type label that indicates method used, 
and less than half of them also have a secondary type label that indicates the "research group" 
involved. For four measurements in the list the type labels in Huchra's list file look ambiguous. 
We picked their type labels according to our understanding of both the corresponding references 
as well as Huchra's definition of types. (The revised list file is available upon request.) The 
primary type provide a simple, but quite likely typical (GOl), criteria for the subgroup study. For 
conciseness, we focus on the primary type classification in the text, mentioning secondary type 
results (shown along with the primary type results in Table 1 and Fig. 1) only when necessary. 
More sophisticated classification schemes require a careful analysis of the systematic effects, which 
is beyond the purview of this paper. 



3.2. Analysis of the complete list 

A median statistics analysis of the 553 Hq values results in a median h = 0.68 and 95% 
statistical confidence limits of 0.67 < h < 0.69. Ideally, h = 0.68 it 0.01 can be quoted as the 
expected value and corresponding 2cr error. However, caution is in order when quoting these 
because the two requirements of median statistics are very likely not fully met by this Hq list. 
There are two main concerns here: systematic errors that are shared by some of the measurements 
(those in a subgroup) and the restating of prior results (in proceedings and summary papers) which 
result in 'restating' correlations. Both of these effects make measurements in the list statistically 
dependent. Since we do not make use of the error information from the list, we will refer to these 
effects as (subgroup) systematic errors. As mentioned in Sec. 2, we can only check the reliability 
of the above results by studying the effects of subgroup systematics, since we choose to ignore the 
individual errors. 

We first group the 553 measurements into 18 subgroups according to the primary type label 
value in Huchra's list file. The size of each subgroup and the corresponding median statistics 
results are shown in Table 1, while Fig. 1 shows the histograms for all but the two smallest primary 
type subgroups. Notice that the subgroup medians are different, and many differences between 
two subgroups are larger than half of the 95% confidence range of either group. It is fair to say 
that most subgroups have a 'subgroup systematic' error which is close to their median minus the 
true value, and, within each subgroup, statistical errors result in different measurements having 
different values. Clearly, systematic errors for different subgroups have different signs and different 



Other techniques for analyzing heterogeneous coUections of measurements, with possibly different systematic 
errors, can be found in, e.g., Press (1997), Bayesian method; Podariu et al. (2001) and Tammann et al. (2008), error 
weighted averaging; and, Freedman & Madore (2010), both Bayesian and frequentist methods. 
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values. Following GOl, from the line labelled 'Subgroup medians' in the Table, we see that ±5.5 
kms~^Mpc~^ is a reasonable estimate of the ±95% systematic errors. Furthermore, considering 
the debates about systematic errors in this field, it is possible that the subgroup systematic errors 
are complex enough that we can consider them as pseudo-random errors at the level of the whole 
list (GOl). In this case we can use these 18 subgroup medians to estimate the overall uncertaintylll 
The result is h = 0.68 ± 0.055 (95% total error) Ij This may be quoted as a conservative constraint 
on the Hubble constant since we are pretty sure that the 18 measurements (i.e., the subgroup 
medians) are statistically independent. 

Now the 'All data' result has an extremely small uncertainty (95% confidence level) range of 
2 kms~^Mpc~^, while the 'Subgroup medians' has a relatively large one, 11 kms~^Mpc~^. But 
since the number of measurements (A^) affects the uncertainty estimate in a manner similar to the 
1/y/N factor in mean statistics (GOl), we can not simply conclude that the larger error estimate 
includes more uncertainty information (the so-called systematic errors, GOl) than the smaller one. 
To examine the effect of subgroup size on the uncertainty estimate we perform a simulation. The 
simulation randomly regroups the 553 measurements into 18 subgroups of the same sizes listed 
in Table 1, and computes the median statistics of the subgroup medians of these new subgroups. 
Regrouping 100 times results in 100 sets of median statistics values (each set consists of the median 
of 18 subgroup medians and corresponding 95% confidence limits). The median of the 100 95% 
c.l. ranges is 5 kms~^Mpc~^, with the largest one equal to 10 kms~^Mpc~^. So we see that the 
uncertainty estimates from the 18 group medians in Table 1, 11 kms~^Mpc~^, does indeed include 
a source of systematic error. This confirms that it is reasonable to use ±0.055 as the 95% total 
error. 



3.3. Analysis of subsamples of the list 

One concern regarding the above results is that the median of 'All data', h = 0.68, may 
be effected by subgroup systematics. To check this, we perform a median statistics analysis of 
truncated lists of measurements, truncated by excluding one subgroup of measurements at a time. 



^This is qualitatively different from the procedure of GOl where the subgroup medians are used to estimate the 
overall systematic uncertainty. Here we think that the median statistics of the subgroup medians will give the 
overall uncertainty including both the systematic uncertainty and the statistical uncertainty. However, since the 
statistical error here is significantly smaller than that determined in GOl (from a smaller set of measurements) while 
the systematic error has not changed significantly, resulting in the systematic error becoming even more dominant, 
the procedure adopted by us does not result in a quantitatively different total error bar compared to what the GOl 
prescription would give. 

*As an alternate estimate, for those concerned about the reliability of early measurements, we can estimate a 
systematic uncertainty by using the latest measurement of each primary type. These are listed in the Table 1 column 
labelled 'Newest', and the corresponding results are listed in the row labelled 'Newest values', with a 2a total error 
of ±6.5 kms-^Mpc-\ 
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We only exclude the largest subgroups, since excluding a subgroup with only a few measurements 
does not result in a discernible change. The results are shown in the right hand part of Table 1. 
We see that excluding any single subgroup docs not significantly alter the median and c.l. range, at 
least in comparison to the 95% total error above. We ignore the 'No 2nd type' results here because 
it is not really a type and also has too many measurements included. The only suspicious cases 
are that excluding the 'Global Summary' set, where the 95% c.l. range expands most, from 2 to 4 
kms~^Mpc~^, and that excluding the 'Sandage' set, where the median changes most, from 68 to 
70 kms-^Mpc-^ 

The two subgroups picked out above have relatively smaller 95% c.l. ranges. Another subgroup 
that has a similar small 95% c.l. range is the 'Key Project' type. We also point out here that 
the 'Global Summary' type includes results from many summary papers, and is likely the main 
contributor to 'restating' correlations. If we look at the histograms of the subgroups (Fig. 1), we see 
that, except for these three subgroups, the scatter within each subgroup is pretty large compared to 
2 kms~^Mpc~^, the 95% range for the 553 measurements, even after considering the approximate 
1/\/N factor effect. That may explain why none of these subgroups affect the median of 'All data' 
significantly. As a further check, we construct a subsamplc that contains all the measurements 
except those belonging to either the primary type 'Global Summary' or the secondary type 'Key 
project' or 'Sandage'. There are 362 measurements in this subsample and the median and 95% 
confidence limits are 68 and 66 ^ 69 kms^^Mpc"^. 

Another consistency check is a "historical" analysis (GOl). Here we consider two subsamples. 
One, 'HST era' set, only includes the 367 measurements post 1996, the other, 'post GOl' set, only 
includes the 196 measurements added to Huchra's list after GOl. The corresponding results are 67 
and 65 ~ 69 kms~^Mpc~^, and 69 and 67 ~ 70 kms~^Mpc~^, respectively. Note that there are 
13 papers in Huchra's list added after GOl, although they predate GOl. These are not included in 
the 'post GOl'. As a reference, we also compute for two more subsamples, 'pre-HST' and 'pre-GOl', 
that are the complements of the above subsamples respectively. The pre-HST set gives 71 and 
67 ~ 75 kms~^Mpc~^, while the pre-GOl set gives 67 and 65 ~ 69 kms~^Mpc~^, identical to the 
GOl result. 

As a consequence of these consistency checks, we believe that the 'All data' median and the 
95% c.l. range of 'subgroup medians' are fairly robust and together provide a reasonable summary 
estimate of Hq. 

4. Conclusion 

We use median statistics to study Huchra's list of 553 Hubble constant measurements. Ignoring 

the errors associated with individual measurements, and assuming there is no systematic error at 
the whole list level, wc determine a constraint on Hq. We use the median of the complete list and 
estimate the error by only sampling one value, the median, from every primary type of subgroup, in 
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an attempt to eliminate any possible correlations. This constraint is i^o = 68± 5.5 kms~ Mpc~ , 
where the 95% error bar includes both systematic and statistical errors. By studying various data 
subsets, we argue that this result is robust and so should be used as a summary estimate of Hq. 

However, without diving into the detailed systematics of each measurement, the statistical 
independence required by the median statistics technique can not be conclusively established for 
the 'All data' set. Nevertheless, given the complexity of the systematic errors associated with 
measuring distances (as evidenced by the heated debates about them), we believe that the above 
constraint is a reasonable summary value. It is probably significant that this lies in the middle of 
the 'low' Tammann et al. (2008) value of Ho = 62.3 ± 1.3 kms~^Mpc~^ and the 'high' Preedman 
& Madore (2010) value of Ho = 73± 4.5 kms~^Mpc~^ (both la errors). 

We thank J. Huchra for his list of Hq measurements. We acknowledge helpful discussions with 
R. Gott and M. Vogeley and financial support from DOE grant DE-FG03-99EP41093, NSFC grant 
10903006, and National Basic Research Program of China (973 Program) grant No.2009CB24901. 
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Table 1. Hubble Constant Medians (in kms ^Mpc ^) by Type 
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'^We only show the c.l. for subgroups with more than 10 measurements because the c.l. for a smaller subgroup 
is not statistically reliable. The range is defined as the difference between the upper and lower limits. 
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Fig. 1. — Distributions of Hubble constant h measurements of the 16 largest (of 18) primary 
and (last column) 4 largest (of 7) secondary type subgroups. Each heading lists the primary or 
secondary type and the number of measurements in the subgroup is shown in the upper right hand 
corner of each panel. Only measurements in the (horizontal axes) range 0.4 < h < 1 are shown, 
but this restriction does not change the impression about the distribution of each subgroup. 
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